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-ILE 'HOME' ENTERED AT 11:30:52 ON 20 MAY 2004 



=> file BIOSCIENCE 

-ILE 1 DRUGMONOG' ACCESS NOT AUTHORIZED 

-ILE 'ADISCTI' ENTERED AT 11:31:02 ON 20 MAY 2004 

IOPYRIGHT (c) 2004 Adis Data Information BV 

-ILE 1 ADISINSIGHT' ENTERED AT 11:31:02 ON 20 MAY 2004 
IOPYRIGHT (C) 2004 Adis Data Information BV 

-ILE 1 ADIS NEWS ' ENTERED AT 11:31:02 ON 20 MAY 2004 
IOPYRIGHT (C) 2004 Adis Data Information BV 

-ILE 1 AGRICOLA 1 ENTERED AT 11:31:02 ON 20 MAY 2004 

-ILE 1 ANABSTR ' ENTERED AT 11:31:02 ON 20 MAY 2004 
IOPYRIGHT (c) 2004 THE ROYAL SOCIETY OF CHEMISTRY (RSC) 



•ILE 'AQUASCI' ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT 2004 FAO (On behalf of the AS FA Advisory Board). All rights reserved. 

: ILE 'BIOBUSINESS 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
IOPYRIGHT (c) 2004 Biological Abstracts, inc. (BIOSIS) 

: ILE 1 BIOCOMMERCE 1 ENTERED AT 11:31:02 ON 20 MAY 2004 

IOPYRIGHT (C) 2004 BioCommerce Data Ltd. Richmond Surrey, United Kingdom. All rights reserved 



"ILE 1 BIOSIS ' ENTERED AT 11:31:02 ON 20 MAY 2004 
IOPYRIGHT (C) 2004 BIOLOGICAL ABSTRACTS INC.(R) 

"ILE 1 BIOTECHABS 1 ACCESS NOT AUTHORIZED 

FILE 1 BIOTECHDS ' ENTERED AT 11:31:02 ON 20 MAY 2004 

IOPYRIGHT (C) 2004 THOMSON DERWENT AND INSTITUTE FOR SCIENTIFIC INFORMATION 
FILE ' BIOTECHNO 1 ENTERED AT 11:31:02 ON 20 MAY 2004 

IOPYRIGHT (C) 2004 Elsevier Science B.V., Amsterdam. All rights reserved. 

FILE 1 CABA 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
IOPYRIGHT (C) 2004 CAB INTERNATIONAL (CABl) 

FILE ' CANCERLIT 1 ENTERED AT 11:31:02 ON 20 MAY 2004 

FILE 'CAPLUS' ENTERED AT 11:31:02 ON 20 MAY 2004 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2004 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE ' CEABA-VTB 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 DECHEMA eV 

FILE 1 CEN 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 American Chemical society (ACS) 

FILE *CIN' ENTERED AT 11:31:02 ON 20 MAY 2004 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2004 American chemical Society (ACS) 

FILE 'CONFSCI' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Cambridge Scientific Abstracts (CSA) 

FILE 'CROPB' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 'CROPU' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 



FILE 'DISSABS' ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT (c) 2004 ProQuest Information and Learning Company; All Rights Reserved. 



FILE 1 DDFB 1 ACCESS NOT AUTHORIZED 

FILE 1 DDFU 1 ACCESS NOT AUTHORIZED 

FILE ' DGENE 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 1 DRUGB 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 'DRUGMONOG2' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 IMSWORLD Publications Ltd 

FILE 1 IMSDRUGNEWS 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 IMSWORLD Publications Ltd 

FILE 1 DRUGU 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE ' IMS RESEARCH 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 IMSWORLD Publications Ltd 

FILE ' EMBAL 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 Elsevier Inc. All rights reserved. 

FILE 'EMBASE' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 Elsevier Inc. All rights reserved. 

FILE 1 ESBIOBASE ' ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT (c) 2004 Elsevier Science B.V., Amsterdam. All rights reserved. 

FILE 1 FEDRIP* ENTERED AT 11:31:02 ON 20 MAY 2004 

FILE ' FOMAD 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 Leatherhead Food Research Association 

FILE 'FOREGE' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 Leatherhead Food Research Association 

FILE 'FROSTI' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Leatherhead Food Research Association 

FILE 1 FSTA 1 ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT (C) 2004 International Food Information Service 

FILE ' GENBANK 1 ENTERED AT 11:31:02 ON 20 MAY 2004 

FILE 1 HEALSAFE 1 ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Cambridge Scientific Abstracts (CSA) 

FILE 'IFIPAT' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 IFI CLAIMS (R) Patent services (IFI) 

FILE 'IMSPRODUCT' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 IMSWORLD Publications Ltd 

FILE 'DICST-EPLUS* ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Japan science and Technology Agency (DST) 

FILE 'KOSMET' ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT (c) 2004 International Federation of the Societies of Cosmetics chemists 

FILE 'LIFESCI' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Cambridge Scientific Abstracts (CSA) 

FILE 1 MEDICONF ' ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT (c) 2004 FAIR BASE Datenbank GmbH, Hannover, Germany 

FILE 'MEDLINE' ENTERED AT 11:31:02 ON 20 MAY 2004 

FILE 'NIOSHTIC' ENTERED AT 11:31:02 ON 20 MAY 2004 

COPYRIGHT (C) 2004 U.S. Secretary of Commerce on Behalf of the U.S. Government 
FILE 'NTIS' ENTERED AT 11:31:02 ON 20 MAY 2004 

Compiled and distributed by the NTIS, U.S. Department of Commerce. 
It contains copyrighted material. 



All rights reserved. (2004) 

FILE 'NUTRACEUT' ENTERED AT 11:31:02 ON 20 MAY 2004 

Copyright 2004 (c) MARKETLETTER Publications Ltd. All rights reserved. 

FILE 'OCEAN' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Cambridge scientific Abstracts (CSA) 

FILE 'PASCAL' ENTERED AT 11:31:02 ON 20 MAY 2004 

Any reproduction or dissemination in part or in full, 

by means of any process and on any support whatsoever 

is prohibited without the prior written agreement of INIST-CNRS. 

COPYRIGHT (C) 2004 INIST-CNRS. All rights reserved. 

FILE ' PCTGEN ' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 WIPO 

FILE 'PHAR' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 PJB Publications Ltd. (PJB) 

FILE ' PHARMAML ' ENTERED AT 11:31:02 ON 20 MAY 2004 

Copyright 2004 (c) MARKETLETTER Publications Ltd. All rights reserved. 

FILE 'PHIC' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 PJB Publications Ltd. (PJB) 

FILE 'PHIN' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 PUB Publications Ltd. (PJB) 

FILE 'PROMT' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 Gale Group. All rights reserved. 

FILE 'RDISCLOSURE' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 Kenneth Mason Publications Ltd. 

FILE 'SCISEARCH' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT 2004 THOMSON ISI 

FILE ' SYNTHLINE ' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (c) 2004 Prous Science 

FILE 'TOXCENTER' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 ACS 

FILE ' USPATFULL ' ENTERED AT 11:31:02 ON 20 MAY 2004 

CA INDEXING COPYRIGHT (C) 2004 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE ' USPAT2 ' ENTERED AT 11:31:02 ON 20 MAY 2004 

CA INDEXING COPYRIGHT (C) 2004 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE ' VETB ' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 'VETU' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 'WPIDS' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 'WPIFV' ENTERED AT 11:31:02 ON 20 MAY 2004 
COPYRIGHT (C) 2004 THOMSON DERWENT 

FILE 'WPINDEX' ACCESS NOT AUTHORIZED 

=> S MEGF10 

53 FILES SEARCHED. . . 
LI 16 MEGF10 

=> DUP REM Ll 

DUPLICATE IS NOT AVAILABLE IN 9 ADISINSIGHT , ADISNEWS, BIOCOMMERCE, DGENE, 
DRUGMONOG2, IMSRESEARCH, FEDRIP, FOREGE, GENBANK, IMSPRODUCT, KOSMET, 
MEDICONF, NUTRACEUT, PCTGEN, PHAR, PHARMAML, RDISCLOSURE, SYNTHLINE'. 
ANSWERS FROM THESE FILES WILL BE CONSIDERED UNIQUE 
PROCESSING COMPLETED FOR Ll 

L2 13 DUP REM Ll (3 DUPLICATES REMOVED) 



=> D L2 1-13 



L2 ANSWER 1 OF 13 US PAT FULL on STN 
AN 2004:7325 US PAT FULL 

TI Proteins, polynucleotides ecoding them and methods of using the same 
IN Anderson, David W. , Branford, CT, UNITED STATES 

Burgess, Catherine E., wethersfield , CT, UNITED STATES 

Casman, Stacie 3 . , North Haven, CT, UNITED STATES 

colman, Steven D., Guilford, CT, united states 

Edinger, Shlomit R . , New Haven, CT, UNITED STATES 

Ellerman, Karen, Branford, CT, UNITED STATES 

Gerlach, Valerie, Branford, CT, UNITED STATES 

Gunther, Erik, Branford, CT, UNITED STATES 

Kekuda, Ramesh, Stamford, CT, UNITED STATES 

MacDougall , John R., Hamden, CT, UNITED STATES 

Mehraban, Fuad, Trumbull, CT, UNITED STATES 

Patturajan, Meera, Branford, CT, UNITED STATES 

Rothenberg, Mark, Clinton, CT, united STATES 

shimkets, Richard A., Guilford, CT, united STATES 

Smithson, Glennda, Guilford, CT, UNITED STATES 

spytek, Kimberly A., New Haven, CT, UNITED STATES 

Stone, David J . , Guilford, CT, UNITED STATES 

vernet, Corine A.M., Branford, CT, UNITED STATES 

Zerhusen, Bryan D. , Branford, CT, UNITED STATES 
PI US 2004005558 Al 20040108 

AI US 2002-52648 Al 20020118 (10) 

PRAI US 2001-262454P 20010118 (60) 

US 2001-272920P 20010302 (60) 

US 2001-284549P 20010418 (60) 

US.2001-303229P 20010705 (60) 

US 2001-262892P 20010119 (60) 

US 2001-263605P 20010123 (60) 

US 2001-269098P 20010215 (60) 

US 2001-264159P 20010125 (60) 

US 2001-265517P 20010131 (60) 

US 2001-271855P 20010227 (60) 

US 2001-267057P 20010207 (60) 

US 2001-286287P 20010425 (60) 
DT Utility 
FS APPLICATION 
LN.CNT 10349 

INCL INCLM : 435/006.000 

INCLS: 435/007.200; 435/069.100; 435/320.100; 435/325.000; 514/012.000; 
514/044.000; 530/350.000; 530/388.100 
NCL NCLM: 435/006.000 

NCLS: 435/007.200; 435/069.100; 435/320.100; 435/325.000; 514/012.000; 
514/044.000; 530/350.000; 530/388.100 

IC [7] 

ICM: C12Q001-68 

ICS: G01N033-53; G01N033-567; A61K038-17; A61K048-00; C12P021-02; 
C12N005-06; C07K014-47; C07K016-18 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 

L2 ANSWER 2 OF 13 CAPLUS COPYRIGHT 2004 ACS on STN DUPLICATE 1 
AN 2001:330391 CAPLUS 
DN 136:64933 

TI Prediction of the coding sequences of unidentified human genes. XX. The 
complete sequences of 100 new cDNA clones from brain which code for large 
proteins in vitro 

AU Nagase, Takahiro; Nakayama, Manabu; Nakajima, Daisuke; Kikuno, Reiko; 
Ohara, Osamu 

CS Kazusa DNA Research Institute, Chiba, 292-0812, Japan 
SO DNA Research (2001), 8(2), 85-95 

CODEN: DARSE8; ISSN: 1340-2838 
PB Universal Academy Press 
DT Journal 
LA English 

RE.CNT 28 THERE ARE 28 CITED REFERENCES AVAILABLE FOR THIS RECORD 
ALL CITATIONS AVAILABLE IN THE RE FORMAT 

L2 ANSWER 3 OF 13 LIFESCI COPYRIGHT 2004 CSA on STN 
AN 2004:35501 LIFESCI 

TI Prediction of the Coding Sequences of Unidentified Human Genes. XX. The 
Complete Sequences of 100 New cDNA clones from Brain Which Code for Large 
Proteins in vitro 

AU Nagase, T. ; Nakayama, M. ; Nakajima, D. ; Kikuno, R. ; Ohara, o. 



CS Kazusa DNA Research Institute, 1532-3 Yana, Kisarazu, Chiba 292-0812, 
Japan 

SO DNA Research [DNA Res.], (20010000) vol. 8, [np] . 

ISSN: 1340-2838. 
DT Journal 
FS N 

LA English 
SL English 

L2 ANSWER 4 OF 13 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAL52376 DNA DGENE 

TI New epidermal growth factor producing Lactococcus lactis or Lactococcus 
casei , useful for promoting gut adsorption or for treating Short Bowel 
Syndrome. 

IN Steidler L 

PA (VLAA-N) VLAAMS INTERUNIVERSITAIR INST BIOTECHNOG. 

(UYGE-N) UNIV GENT. 
PI WO 2004001020 A2 20031231 
AI WO 2003-EP50242 20030619 
PRAI EP 2002-77532 20020619 
DT Patent 
LA English 
OS 2004-071734 [07] 

DESC Murine epidermal growth factor coding sequence oligo ***megf10*** 
L2 ANSWER 5 OF 13 GENBANK. RTM. COPYRIGHT 2004 on STN 



LOCUS (LOC) : 

GenBank acc. NO. (GBN) 

GenBank version (ver) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



KEYWORDS (ST) : 
SOURCE: 
ORGANISM (ORGN): 



CK656927 GenBank (R) 
CK656927 

CK656927.1 Gl:44893777 
660084-00-4 
483 

mRNA; linear 
Expressed sequence tag 
3 Mar 2004 

3017HFEX25F2 BgORESTES schistosome-exposed NHM 3017 
Head/foot 2 Biomphalaria glabrata cdna clone ZBA0961 
similar to ***MEGF10*** protein, mRNA sequence. 
EST 

Biomphalaria glabrata (bloodfluke planorb) 
Biomphalaria glabrata 

Eukaryota; Metazoa; Mollusca; Gastropoda; Pulmonata; 
Basommatophora; Lymnaeoidea; Planorbidae; Biomphalaria 

COMMENT: 

Contact: Lockyer, A.E. 

Wolf son Wellcome Biomedical Laboratory 

The Natural History Museum 

Cromwell Road, London, SW7 5BD, UK 

Tel: +44 (0)20 7942 5148 

Fax: +44 (0)20 7942 5518 

Email : a.lockyer@nhm.ac.uk 

High quality sequence stop: 483 

POLYA=NO. 

1 (bases 1 to 483) 

Lockyer, A.E. ; Spinks,J.N.; Kane, R. A.; Dias Neto,E.; 
Nobl e , L . R . ; Rol 1 i nson , D . ; Jones , C . S . 
ESTs from Biomphalaria glabrata using the ORESTES 
approach 

Unpublished (2003) 



REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO) 



FEATURES (FEAT) : 

Feature Key 
===============+== 

source 1. 



Location 



.483 



Qualifier 



/orqanism="Biomphalaria glabrata" 

/mol-type="mRNA" 

/strain= M NHM 3017" 

/db-xref="taxon:6526" 

/clone= n ZBA0961 n 

/sex^'Hermaphrodite" 

/ti ssue-type="Head/f oot" 

/cl one-1 i b=" BgORESTES 

schistosome-exposed NHM 3017 

Head/foot 2" 

/note="Vector: pGEM; mRNA, 
extracted from schistosome-exposed 
3017 (resistant) snails was used 



as a template for RT-PCR with 
random primers to generate cdna 
fragments. These were cloned and 
sequenced using M13F. Primer 
sequences were removed from 
sequence. " 



SEQUENCE (SEQ): 

1 ggtcagaact 
61 gtcgatggta 
121 gatgagtgtg 
181 aagggctctg 
241 tgtcaagcct 
301 acaaccaaca 
361 actggggctg 
421 caatgtaaat 
481 tgt 



gttccaatgt 
catgcacttg 
ctattaattt 
cggaatgtat 
gtgactctct 
cagcagattg 
attgctcgca 
gtcagttaaa 



ctgccaatgt 
taaaactggc 
aacattctgc 
ttgtcaagat 
ccactatgga 
taatgatgtc 
aggttgtgat 
taacagcgct 



gatgtcagta 
tggactggaa 
tcaaatcttc 
ggtttctaca 
tcaaactgca 
aatggaacat 
tccctgcact 
gtatgcaata 



attctctaaa 
ccaactgtga 
atgaagtttg 
attcaacaat 
cctctgtatg 
gttcttgtaa 
atggtcaaga 
gcactgatgg 



ctgtaatgat 
acaggacata 
ccgcaatgtg 
tttattcagt 
ctcatgtcat 
acctggatgg 
ctgcaccttg 
acactgcaca 



L2 ANSWER 6 OF 13 

LOCUS (LOC): 
GenBank ACC. NO. (gbn) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

CD648703 GenBank (R) 
CD648703 

CD648703.1 Gl: 31905003 

537522-34-2 

801 

mRNA; linear 
Expressed sequence tag 
18 Jun 2003 

AUF_103_I07_T7 Crassostrea virginica Gill Crassostrea 
virginica cDNA 5' similar to ***MEGF10*** protein, 
mRNA sequence. 
EST 

crassostrea virginica (eastern oyster) 
Crassostrea virginica 

Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; 
Ostreoida; ostreoidea; Ostreidae; Crassostrea 
253 a 130 c 157 g 261 t 



NUCLEIC ACID COUNT (NA) : 
COMMENT : 

Contact: Liu ZD 

The Fish Molecular Genetics and Biotechnology Laboratory, 
Department of Fisheries and Allied Aquacultures and Program of Cell 
and Molecular Biosciences 
Auburn University 

203 Swingle Hall, Auburn University, Auburn, AL 36849, USA 
Tel : 334 844 4054 
Fax: 334 844 9208 
Email: zliu@acesag.auburn.edu 
seq primer: M13 Reverse. 
reference: 1 (bases 1 to 801) 

author (AU): Peatman, E.; Kucuktas,H.; Li, P.; He,C; Feng, J.; wei,X.; 

Li u , Z . 

TITLE (TI): Differentially expressed oyster (Crassostrea vi rginica) 

genes after exposure to mercury 
JOURNAL (SO): Unpublished 



FEATURES (FEAT); 
Feature Key 



source 



Location 
1. .801 



Qualifier 



/organi sm= n Crassost rea vi rgi ni ca" 

/mol-type^'mRNA" 

/db-xref="taxon:6565" 

/cl one-1 i b="Crassost rea vi rgi ni ca 

Gill" 

/note="organ: Gill; Vector: 
psportl; Site-1: Noti; site-2: 
sail" 



SEQUENCE (SEQ): 
1 gtttttgcct 
61 ccttctgtag 
121 tcaatctcct 
181 agccccgtgt 
241 aatggatatt 
301 tgcaatagac 
361 tgtgctgttt 
421 gtgaacgatg 



agaagcggat 
atgtgtgatg 
ataatgtaaa 
gttggtctac 
acggtaacaa 
tgaatggaac 
gtagagacgg 
agccttgtga 



gggtaatcgg 
gcatataaac 
tgtattgttt 
ctttattcca 
ctgtaatatt 
ttgtgaaaat 
attctacaac 
taaagtaaca 



gtacattttc 
gaattcgtca 
aacgccgaaa 
atcataaaat 
acgtgcgggt 
gggtgcatca 
agtagatgca 
ggagaatgca 



tgaactggtg 
acatattttt 
catcatgatt 
cgatacttgt 
attgcctaca 
accattttaa 
cttctcaatg 
ggaatggatg 



gtgataggtg 
ttcgtcattt 
gagcccattt 
agaatgcagt 
aggaagtgta 
agaaccgaga 
cggcaagtgt 
tcaacaacat 



481 tttgaacctc ctttgtgtca gggtttgtaa tgatatatat attgtgtact 

541 cttcaacgtg ttatatgata tattgcattc aaactgtctc atagcattga 

601 aaagtcaaaa ctcgcttttg cattattaat gtatttcacg taatgaattt 

661 gttatgttaa cgtcttaaaa acacgcttta aactacacgt ccaaaataca 

721 acctgtgtaa gaatatagat aaaataagta ggtttcttct taaccattaa 
781 atacaaaatc gtgtttaaac a 



gatatgcacg 
ccagcattgt 
cctgtataca 
tttatttgtg 
aaacaaaaca 



L2 ANSWER 7 OF 13 



LOCUS (loc): 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 

CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



keywords (st) : 
source: 

ORGANISM (ORGN): 



CD646852 
CD646852 

CD646852.1 Gl: 31901304 
537503-80-3 
572 

mRNA; linear 
Expressed sequence tag 
18 Dun 2003 

AUF_106_F02__T7 Crassostrea virginica Gonad Crassostrea 
virginica cDNA 5' similar to ***MEGF10*** protein, 
mRNA sequence. 
EST 

crassostrea virginica (eastern oyster) 
Crassostrea virginica 

Eukaryota; Metazoa; Mollusca; Bivalvia; Pteriomorphia; 
Ostreoida; ostreoidea; Ostreidae; crassostrea 
NUCLEIC ACID COUNT (NA) : 174 a 85 c 144 g 169 t 
COMMENT : 

Contact: Liu ZJ 

The Fish Molecular Genetics and Biotechnology Laboratory, 
Department of Fisheries and Allied Aquacultures and Program of Cell 
and Molecular Biosciences 
Auburn University 

203 Swingle Hall, Auburn university, Auburn, AL 36849, USA 
Tel : 334 844 4054 
Fax: 334 844 9208 
Email : zliu@acesag.auburn.edu 
Seq primer: M13 Reverse. 
REFERENCE: 1 (bases 1 to 572) 

AUTHOR (AU) : Peatman, E.; Kucuktas,H. 

Liu,Z. 

TITLE (TI): Differentially expressed oyster (Crassostrea virginica) 

genes after exposure to mercury 
journal (SO): unpublished 



Li, P.; He,C; Feng, J.; Wei,X. 



FEATURES (FEAT) 
Feature Key 



Location 



Qualifier 



source 



1. .572 



/orqanism="Crassostrea virginica" 

/mol-type^'mRNA" 

/db-xref="taxon:6565" 

/cl one-1 i b="Crassostrea vi rgi ni ca 

Gonad" 

/note="organ: Gonad; Vector: 
psportl; site-1: Notl; Site-2: 
Sail" 



SEQUENCE (SEQ): 

1 gtgatagtgg 
61 gcttctatgg 
121 tgaacggtta 
181 aatgcatgtc 
241 gtagaacatg 
301 ctccattctg 
361 gtggttcatt 
421 gctgtaaaag 
481 atacagattg 
541 tgattggtct 



atgtaaaccg ggctggaagg 
acaggactgc ctacttacat 
ctgtgataga ggatgtaaac 
tgacttttat ggcgaaaact 
ccaccacata aatggcacat 
taacgaagag tgtgattttg 
ttgcaaaata tcacgtgatt 
tggttggcaa ggaaaggatt 
gaaatcaaga ttttacggaa 
tcttattgcc taccatatta 



gacaatactg 
gtaacgacaa 
caggctggaa 
gtagcagaaa 
gtgaacaggg 
gattctacgg 
gtcatcattt 
gttttgaagt 
tgttgggggc 
ta 



tcacgaacaa 
atgtaacggt 
gggggacaac 
ttgtggatat 
atgtaatccg 
aaatagatgc 
gactggtttt 
ttcgaaactt 
gttcttcgta 



tgtcaatatg 
tgtaacaatg 
tgtcaacaac 
tgcttagatg 
ggttacaaag 
cttcaggaat 
tgtaaaaacg 
gaagacagta 
ttgttgattt 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (Cl) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

CD034532 GenBank (R) 
CD034532 

CD034532.1 Gl: 30416370 

514024-90-9 

471 

mRNA; linear 



DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF) : 

KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN): 



Expressed sequence tag 
7 May 2003 

mqmt016xKl3f .b Mated culture Magnaporthe grisea cDNA 
clone mgmt016xKl3 5' f mRNA sequence. 
EST 

Magnaporthe grisea (anamorph: Pyricularia grisea) 
Magnaporthe grisea 

EuRaryota; Fungi; Ascomycota; Pezizomycotina; 
sordariomycetes; Sordariomycetes incertae sedis; 
Magnaporthaceae; Magnaporthe 
NUCLEIC ACID COUNT (NA) : 116 a 125 c 91 g 139 t 
COMMENT: 

Contact: Ebbole DJ 

Department of Plant Pathology & Microbiology 
Texas A&M university 

Peterson Bldg, MS2132, College station, TX 77843-2132, USA 

Tel : 979 845 4831 

Fax: 979 845 6483 

Emai 1 : d-ebbol e@tamu . edu 

Chromatogram file of this sequence is available, see contact person 
;Best nr hit (April. 22, 2003) ref |XP_236337.1| similar to ***MEGF10* 
protein [Homo sapiens] [Rattu... 33 1.2 
PCR PRimers 
FORWARD: T3 primer 
BACKWARD: T7 primer 
Plate: mgmt016 row: K column: 13 
Seq primer: T3. 
REFERENCE: 1 (bases 1 to 471) 

AUTHOR (AU): Ebbole,D.J. ; Yuan, J.; Thomas, T.L. 

Bhatterai,K. ; Dean, R. A. 
TITLE (TI): Expressed sequence tags from the rice blast fungus, 

Magnaporthe grisea 
JOURNAL (so): unpublished (2002) 



Bobrowicz,P. ; Lu, 



FEATURES (FEAT): 
Feature Key 

source 1. .471 



Location 



Qualifier 



SEQUENCE (SEQ): 

1 ggcttttctt 
61 ttttcttctt 
121 aggcttctta 
181 ccgaaaataa 
241 ccaagaggac 



/organism="Magnaporthe grisea" 
/strain="4091-5-8 X 4136-4-3" 
/db-xref="taxon : 148305" 
/clone="mqmt016xKl3" 
/clone-lib="Mated culture" 
/sex="Matl-2 and Matl-1 mixted 
culture" 

/cell-type="mixed sexual 
development" 

/dev-stage="asci , ascospores , 
perithecia, mycelium" 
/note="Vector: pBluescriptSK; 
site-1: EcoRl; Site-2: Xhol; Two 
mating types were co-cultivated 
over a filter paper on oatmeal 
agar medium. After three days at 
25 c plates were transfered to 21 
C. Perithecia with asci and 
ascospores formed at the begining 
of the third week. Material was 
collected by scraping tissue from 
the filter paper, sequences were 
processed by one of two methods, 
where a full-length alignment to 
the M. grisea genome sequence was 
available, the EST sequence was 
trimed according to the alignment, 
otherwise sequence quality was 
assessed using phredPhrap version 
991019 and trimmed according to 
phd files (0.05) and for vector 
seqs." 



tctttttcgt 
tttttttctt 
gtactaggta 
aatcgggatt 
gactcggctt 



ttgaccagac 
ctcttttctt 
cttctcttga 
tccgcgatcc 
tatcaaaacc 



ccaagccgca 
cgtcaactct 
aagcaggtac 
gcgtttcgca 
atcgcgctca 



aatccatcca 
caccgaatta 
atcattctcg 
ttttcaacga 
tcctagaagt 



acgatatact 
ctgccgacat 
acccgaaacg 
caaaccatta 
gctgctgttt 



301 tctactacac caagaacact gcctcgctga atttattagc gaacgaacga agattctctg 
361 gcatgtcggt catcacacag gcaacggcca tgttggccaa gcgagctgat ggatgtcccg 
421 atggatacta ccggtctgga aactactgtt atcggggatc tggatggtac t 
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LOCUS (LOC): 

GenBank ACC. NO. (GBN) 

GenBank version (ver): 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF) : 



SOURCE: 
ORGANISM (ORGN): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

BU493977 GenBank (R) 
BU493977 

BU493977.1 Gl: 22790235 

453296-65-6 

257 

mRNA ; linear 
Expressed sequence tag 
1 Oct 2002 

vaa03f07.y2 sv DOGRDA RD3 canis familiar is cDNA 5' 
similar to gp|AAH20198.1| BC020198_1 Similar to 

***MEGF10*** protein [Homo ;, mRNA sequence, 
dog. 

Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Carnivora; 
Fissipedia; canidae; Canis 
67 a 65 c 60 g 65 t 



MO 63108, USA 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

other_ESTs: vaa03f07.x2 

contact: skip Virgin 

RNA Expression in Diseased Tissues by RDA 
Washington University School of Medicine 
4444 Forest Park Parkway, Box 8501, St. Louis 
Tel : 314 286 1800 
Fax: 314 286 1810 
Emai 1 : est@watson . wustl . edu 

Library was constructed by Tianxiang shen and Skip virgin DNA 
sequencing by: Washington University Genome Sequencing center For 
information on obtaining a clone please contact: Skip Virgin 
(vi rgi n@i mmunol ogy .wustl . edu) 
Possible reversed clone: simil 
Seq primer: -40RP from Gibco 



larity on wrong strand 



REFERENCE; 
AUTHOR 



(AU): 



; Lay,M.; 
Marti n , 3 . ; 
Gibbons, M. ; 



TITLE (TI) : 
JOURNAL (SO) 

FEATURES (FEAT) 
Feature Key 



1 (bases 1 to 257) 

virqin,S.; Tianxiang,s. ; Karst,s.; Wobus,C 
clifton.S.; Pape,D.; Marra,M.; Hillier,L.; 
wylie,T.; Dante, M.; Theising,B.; Bowers, y. . 
Ritter,E.; Bennett, J.; Ronko,l. f Tsagareishvili , R 
Maguire,L.; Kennedy.S.; Waterston,R. ; Wilson, R. 
RNA Expression in Diseased Tissues by RDA 
Unpublished (2002) 



Location 
~257 



Qualifier 



source 



SEQUENCE (SEQ) : 
1 cacaggcact 
61 caatgcagcg 
121 cataaaatcc 
181 aggcagtccg 
241 agctggtgta 



ggagcagttg gttccacccc 
accatggaca catttatcag 
aggacaacac tgggatttgc 
ataactgatt ctgtgccgtg 
gtagatt 



/organism="Canis familiaris" 
/db-xref="taxon:9615" 
/clone-lib= M SV DOGRDA RD3" 
/tissue-type="Brain from a dog 
with encephalitis" 
/lab-host="DH5a" 
/note="Vector: pCR2.1-TOPO; PCR 
products were directly cloned into 
TA cloning vector (pre-cut vector 
PCR2.1-TOPO was purchased from 
invitrogen) Average insert size is 
300bp. our Insert also have same 
adaptor sequence on both ends. The 
sequence of the adaptor is: 
5 ' - ATACGTGCAGGCTGGTTACATCTG- 3 ' . " 



agccaggctc acactgacag gtgtttggag 
cacagtgggg gacacacatt tccctgcttt 
gcctatacat agttttctcc ccatgtcgat 
tacatttaaa ccagttcaga atatcagtgc 
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LOCUS (LOC) 



BU493770 



GenBank (R) 



GenBank acc. NO. (GBN) ; 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
VIOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



SOURCE: 
ORGANISM (ORGN): 



BU493770 

BU493770 . 1 GI =22790028 

453296-12-3 

311 

mRNA; linear 
Expressed sequence tag 
1 Oct 2002 

vaa03f07.x2 SV DOGRDA RD3 Canis familiaris CDNA 3' 
similar to gp| BAB47409.il AB058676_1 ***MEGF10*** 
protein ;, mRNA sequence, 
dog. 

Canis familiaris 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Carnivora; 
Fissipedia; Canidae; Canis 
: 82 a 71 c 75 g 82 t 



1 others 



NUCLEIC ACID COUNT (NA) 
COMMENT : 

other_ESTs: vaa03f07.y2 

Contact: Skip Virgin 

RNA Expression in Diseased Tissues by RDA 
Washington University school of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel : 314 286 1800 

Fax: 314 286 1810 

Email: est@watson.wustl.edu 

Library was constructed by Tianxiang shen and Skip Virgin dna 
sequencing by: Washington University Genome Sequencing Center For 
information on obtaining a clone please contact: Skip Virgin 
(vi rgi n@i mmunol ogy . wustl . edu) 
Putative full length read 
vector to vector length is 312 

Possible reversed clone: similarity on wrong strand 
Seq primer: -40UP from Gibco. 

1 (bases 1 to 311) 
(AU): Virgin, s.; Tianxiang,S. ; Karst,S.; Wobus,C; Lay,M.; 

Clifton, S.; Pape,D.; Marra,M.; Hillier,L.; Martin, J.; 
Wylie.T.; Dante, M.; Theising,B.; Bowers,Y.; Gibbons, M.; 
Ritter,E.; Bennett, 3.; Ronko.I., Tsagareishvili , R. ; 
Maguire,L.; Kennedy, s.; waterston ,R. ; Wilson, R. 
RNA Expression in Diseased Tissues by RDA 
Unpublished (2002) 



REFERENCE 
AUTHOR 



TITLE (TI): 
JOURNAL (SO); 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



1. .311 



/organism="Canis familiaris" 
/db-xref="taxon:9615" 
/clone-lib="SV DOGRDA RD3" 
/tissue-type="Brain from a dog 
with encephalitis" 
/lab-host="DH5a" 
/note="Vector: pCR2.1-TOPO; PCR 
products were directly cloned into 
TA cloning vector (pre-cut vector 
pCR2.1-TOPO was purchased from 
invitrogen) Average insert size is 
300bp. Our Insert also have same 
adaptor sequence on both ends. The 
sequence of the adaptor is: 
5 1 - ATACGTGCAGGCTGGTTACATCTG- 3 1 . " 



SEQUENCE (SEQ) : 
1 tgatcaaatc 
61 cagaatcagt 
121 ccagtgttgt 
181 atgtgtccat 
241 aaccaactgc 
301 ctantgaatt 



tactacacca 
tatcggactg 
cctggatttt 
ggtcgctgca 
tccagtgcct 
c 



gctgcactga 
cctatcgaca 
atgaaagcag 
ttgctccaaa 
gtgatggtga 



tattctgaac 
tggggagaaa 
ggaaatgtgt 
cacctgtcag 
tcagatgtaa 



tggtttaaat 
actatgtata 
gtcccccact 
tgtgagcctg 
ccagcctgca 



gtacacggca 
ggcgcaaatc 
gtgctgataa 
gctggggtgg 
cgtataatca 
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LOCUS (LOC): BM427205 GenBank (R) 

GenBank ACC. NO. (GBN): BM427205 

GenBank VERSION (VER): BM427205.1 GI:18432195 

CAS REGISTRY NO. (RN) : 396916-69-1 

SEQUENCE LENGTH (SQL) : 668 



MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF): 



Fat 



SOURCE : 
ORGANISM 



(ORGN) 



mRNA; linear 
Expressed sequence tag 
30 Jan 2002 

pgf2n.pk006.al5 Normalized Chicken Abdominal 
Library (pgf2n) Gall us gall us cDNA clone 
pgf2n.pk006.al5 5' similar to gi | 14192943 
ref |NP_115822.1| ***MEGF10*** protein [Homo 
sapiens] gi 114724016 ref |XP_030163.1| ***MEGF10*** 
protein [Homo sapiens] dbj | BAB47409 .1| (AB058676) 

***MEGF10*** protein (KIAA1780) [Homo sapiens], mRNA 
sequence, 
chicken. 
Gall us gall us 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Archosauria; Aves; Neognathae; 
Gall i formes; Phasianidae; Phasianinae; Gall us 



NUCLEIC ACID COUNT (NA) : 108 a 256 c 199 g 104 t 1 others 
COMMENT: 

Contact: Larry A. cogburn 

University of Delaware 

Townsend Hall, Newark, DE 19717, USA 

Tel: 302-831-1335 

Fax: 302-831-2822 

Email: cogburn@udel.edu, www. 
REFERENCE: 1 (bases 

author (AU): Cogburn, L. 

TITLE (Tl): ESTs from 



.edu. 



JOURNAL (SO) 



.chickest .udel 
1 to 668) 

.A.; Morgan, R.; Burnside,J. 
Normalized Chicken fat cDNA 
library-USDA/IFAFS Animal Genome Project 
Unpublished (2002) 



FEATURES (FEAT): 

Feature Key Location 
===============+============= 

source 1..668 



Qualifier 



/organism="Gallus gall us" 
/strain="CommerciaT broiler, 
Ottawa Research Centre strains, 
commercial leghorn" 
/db-xref="taxon:9031" 
/el one="pqf 2n . pk006 . al5" 
/el one-1 i b="Normal i zed Chi cken 
Abdominal Fat Library (pgf2n)" 
/sex="Male and Female" 
/ti ssue-type="Abdomi nal Fat" 
/dev-stage="Embryoni c (dl8 , dl9) ; 
post-hatch (dl,w3,w7,w9 ,wl6,lyr) M 
/lab-host="E. coli EMDH10B" 
/note="Vector : pCMVSPORT6; Library 
made from equivalent pools of 
total RNA isolated from each 
developmental age (across 
strains); Single pass sequencing 
from 5' -end" 



SEQUENCE (SEQ): 

1 ccaccccagc gaccccaacg tctgcagcta ctgggagagt tttacagtag cagtgaagga 
61 gtcatacgcc aaaccccaca ttgtgacctc caccgagccc tgcactggga caccaggatt 
121 gccacagtcc tgcccgcagc aaagggtcgt gtaccgcacg gagtaccgac aggcggtgcg 
181 caccgactac cgccggcgct accagtgctg cctgggctac tatgagagca gggacgcctg 
241 cgtcccgcgc tgcacccagg agtgcgtcca cgggcggtgc gtggcccccg agcgttgcca 
301 gtgtgagccg ggctggaggg gacacgactg ctccagcgcg tgtgatgagc gtttgtgggg 
361 caagaactgc gagcaccact gtgactgcca ccacggggcc ccctgcaatc ccctgagtgg 
421 ggtctgtgcc tgcccccctg gcttcacccc cccactgtgc caccagccct gcccgcccgg 
481 ctcctacggc cccgactgcc gcctgcactg cccatgccac cacggggccc cctgcaacgc 
541 ctccactggt gcctgcctct gccccccagg acttgctggc ccactctgtg aggtgctgtg 
601 ccccgagggg acaccgtgcg gcacccactg cccctgccag aatgggggta tctgccaccc 
661 cccnggct 
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LOCUS (LOC) : BC020198 GenBank (R) 

GenBank acc. no. (gbn) : BC020198 

GenBank VERSION (VER) : BC020198.1 GI: 18044365 

CAS REGISTRY NO. (RN) : 381931-64-2 

SEQUENCE LENGTH (SQL): 2267 

MOLECULE TYPE (ci) : mRNA; linear 

DIVISION CODE (ci): Primates 



DATE (DATE): 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



protein, mRNA (cDNA clone 



16 Sep 2003 

Homo sapiens ***MEGF10*** 
IMAGE : 4904255) , complete cds . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 
543 a 548 c 631 g 545 t 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

Contact: MGC help desk 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. consortium (LLNL) 

DNA Sequencing by: Genome Sequence centre, 

BC cancer Agency, Vancouver, BC, Canada 

info@bcgsc.bc .ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman chiu, Chris Fjell , Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi , Jacqueline 
Schein, Duane Smailus, Michael smith, Lorraine spence, Jeff Stott, 
Michael Thorne, Miranada Tsai , Natasja van den Bosch, Jill Vardy, 
George Yang, scott Zuyderduyn, Marco Marra. 

clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 40 Row: o Column: 4 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 14192942 
This clone has the following problem: The cds is short compared to 
the longest cds in the locus 



REFERENCE: 

AUTHOR (AU): 



.L.H.; 

, ; Wagner, L. ; 
.F. ; Zeeberg,B. 
Hopkins, R.F. ; 
; Hsieh,F.; 
Rubin, G.M. ; 



TITLE (TI): 

JOURNAL (SO): 

OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



1 (bases 1 to 2267) 

Strausberg , R . L . ; Feingold,E.A. ; Grouse, 
Derge,J.G.; Klausner ,R.D. ; Collins,F.s. 
Shenmen,C.M. ; schuler ,G.D. ; Altschul,s. 
Buetow,K.H.; schaefer,C.F. ; Bhat,N.K.; 
Jordan, H.; Moore, T.; Max,s.l.; Wang, J.; 
Diatchenko,L. ; Marusina,K.; Farmer, A. A. 
Hong,L.; stapleton,M. ; soares,M.B.; Bonaldo,M.F. ; 
Casavant,T.L. ; Scheetz,T.E. ; Brownstei n , M . J . ; 
Usdin,T.B.; Toshiyuki ,S . ; carninci,P.; Prange,c; 
Raha,S.S.; Loquellano,N.A. ; Peters, G. J.; Abramson,R.D. 
Mul 1 ahy , s . J . ; Bosak,S.A.; McEwan,P.J.; McKernan , K . J . ; 
Malek,J.A.; Gunaratne.P.H. ; Richards, s.; Worley,K.C; 
Hale,S.; Garcia, A.M.; Gay,L.J.; Hulyk,s.W.; 
Villalon,D.K. ; Muzny.D.M.; Sodergren , E . J . ; Lu,X.; 
Gibbs,R.A.; Fahey.J.; Helton, E.; Ketteman,M.; Madan,A. 
Rodrigues,S. ; Sanchez, A.; Whiting, M.; Madan,A.; 
Young, A. C; Shevchenko , Y. ; Bouffard,G.G. ; 
Blakesley.R.W. ; Touchman, J .W. ; Green, E.D.; 
Dickson, M. c. ; Rodriguez, A. c. ; Grimwood,J.; Schmutz.J.; 
Myers, R.M.; Butterfield, Y.s. ; Krzywinski , M.I. ; 
skalska,U.; smailus ,D.E. ; Schnerch,A.; Schein, J. E.; 
Jones, S.J. ; Marra, M. A. 

Generation and initial analysis of more than 15,000 

full-length human and mouse cDNA sequences 

Proc. Natl. Acad, sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84323 

2 (bases 1 to 2267) 
Strausberg, R. 
Direct Submission 



institutes of Health, 
Cancer Genomics 



Submitted (19-DEC-2001) National 
Mammalian Gene Collection (MGC), 
office, National Cancer institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT): 

Feature Key 
===============+== 

source 1. 



Location 
"2267 



Qualifier 



/orqanism="Homo sapiens" 
/mol-type= ,, mRNA" 
/db-xref="taxon:9606" 
/clone="IMAGE:4904255" 



/tissue-type="Muscle, 

rhabdomyosarcoma" 

/cl one- I i b="NlH-MGC-17" 

/lab-host="DHlOB-R" 

/note="Vector : p0TB7 n 

gene 1 . . 2267 /gene="MEGFlO" 

/note="synonym: KIAA1780" 
/db-xref="LocuslD:84466" 

CDS 280.. 1983 /codon-start=l 

/product="MEGFlO protein" 
/protei n-i d="AAH20198 . 1" 
/db-xref="Gl: 18044366" 
/db-xref="LocusID: 84466" 
/translation="MViSLNSCLSFiCLLLCHWl 

GTASPLNLEDPNVCSHWESYSVTV 

QESYPHPFDQIYYTSCTDILNWFKCTRHRVSYRT 

AYRHGEKTMYRRKSQCCPGFYESG 

EMCVPHCADKCVHGRCIAPNTCQCEPGWGGTNCS 

SACDGDHWGPHCTSRCQCKNGALC 

NPITGACHCAAGFRGWRCEDRCEQGTYGNDCHQR 

CQCQNGATCDHVTGECRCPPGYTG 

AFCEDLCPPGKHGPQCEQRCPCQNGGVCHHVTGE 

CSCPSGWMGTVCGQPCPEGRFGKN 

CSQECQCHNGGTCDAATGQCHCSPGYTGERCQDE 

C PVGT YGV LCAETCQCVNGGKCYH 

VSGACLCEAGFAGERCEARLCPEGLYGIKCDKRC 

PCHLENTHSCHPMSGECACKPGWS 

G L YCN ETCS PG F YG EACQQICS CQN GADCDS VTG 

KCTCA PG FKGI DCST PC P LGTYGI 

NCSSRCGCKNDAVCSPVDGSCTCKAGWHGVDCSI 

RCPSGTWGFGCNLTCQCLNGGACN 

TLDGTCTCAPGWRGEKCELPCQDGTYGLNCAERC 

DCSHADGCHPTTGHCRCLPGWSGL F" 

SEQUENCE (SEQ) : 

1 aaactgggcg actgggaacg cgttgagacg ttcctctttc ccgcttctcc acctttacgc 
61 ctgaaagaag actcccaaag attgctttct tctgggacgc tgcttgaacg ctaaccgcgt 
121 tgattggaac agattttgtg tcttggctgg ctttgggtga agaccgggga gaaaaggttg 
181 cgctgcgatt ctcaagatct ctggacctgg attatcattg caaatccatt aaagaaggag 
241 aagcaagcgg atttcagaga ggttgttctt cagaaaaaaa tggttatttc tttgaactca 
301 tgcctgagct ttatttgttt attgttatgc cactggattg ggacagcatc acctctgaat 
361 cttgaagacc ctaatgtgtg tagccactgg gaaagctact cagtgactgt gcaagagtca 
421 tacccacatc cctttgatca aatttactac acgagctgca ctgacattct aaactggttt 
481 aaatgcacgc ggcacagagt cagctatcgg acagcctatc gacatgggga gaagactatg 
541 tataggcgca agtctcagtg ttgtcctgga ttttatgaaa gcggggaaat gtgtgtcccc 
601 cactgtgctg ataaatgtgt ccatggtcgc tgtattgctc caaacacctg tcagtgtgag 
661 cctggctggg gagggaccaa ctgctccagt gcctgcgatg gtgatcactg gggtccccac 
721 tgcaccagcc ggtgccagtg caaaaatggg gctctgtgca accccatcac cggggcttgc 
781 cactgtgctg cgggcttccg gggctggcgc tgcgaggacc gctgtgagca gggcacctat 
841 ggtaacgact gtcatcagag atgccagtgc cagaatggag ccacctgcga ccacgtcacg 
901 ggggaatgcc gctgcccacc aggatacacc ggagccttct gtgaggatct ttgtcctcct 
961 ggtaaacatg gtccacagtg tgagcagaga tgcccttgtc aaaatggagg agtgtgtcat 
1021 cacgtcactg gagaatgctc ttgcccttct ggctggatgg gcacagtgtg tggtcagcct 
1081 tgccccgagg gtcgctttgg aaagaactgt tcccaagaat gccagtgcca taatggaggg 
1141 acgtgtgatg ctgccacagg ccaatgtcat tgcagtccag gatacacagg ggaacggtgc 
1201 caggatgagt gtcctgttgg gacctatggc gttctctgtg ctgagacctg ccagtgtgtc 
1261 aacggaggga agtgttacca cgtgagcggc gcatgcctct gtgaagcagg ctttgctggc 
1321 gagcgctgcg aagcacgcct gtgtcctgag gggctctacg gcatcaaatg tgacaaacgg 
1381 tgtccctgcc acttggaaaa cactcatagc tgtcacccca tgtctggaga gtgtgcctgc 
1441 aagccgggct ggtcaggact ctactgtaat gagacatgtt ctcctggatt ctacggggaa 
1501 gcttgccagc agatctgcag ctgccaaaat ggggcagact gtgacagtgt gactggaaag 
1561 tgcacctgtg ccccaggatt caaaggaatt gactgctcta ccccatgccc tctgggaacc 
1621 tatgggataa actgttcctc tcgctgtggc tgtaaaaatg atgcagtctg ctctcctgtg 
1681 gacgggtctt gtacttgcaa ggcaggctgg cacggggtgg actgctccat cagatgtccc 
1741 agtggcacat ggggctttgg ctgtaactta acatgccagt gcctcaacgg gggagcctgc 
1801 aacaccctgg acgggacctg cacgtgtgca cctggatggc gcggggagaa atgcgaactt 
1861 ccctgccagg atggcacgta cgggctgaac tgtgctgagc gctgcgactg cagccacgca 
1921 gatggctgcc accctaccac gggccattgc cgctgcctcc ccggatggtc aggattgttt 
1981 tgactattct gggtccttga atttccatgt gaatggtaac atcaatatgt taatttccac 
2041 aaagaagcaa ggtggaaatt taatggagat tgaatctata gatcaatttg gggaatactg 
2101 tccacttaac aatatgaagt cttcagattc ataaacatgg gatgtcacca gcaacctctt 
2161 cagtagtgtc tctcagcccc ttggaaaatg ttgagagctt ctttggtttc taaaggagta 
2221 aactgagatg aactgtcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 
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/organism="Homo sapiens" 
/db-xref="taxon:9606" 
/clone="pf 01012" 
/tissue-type="hippocampus" 
/dev-stage="adu1r' 
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/codon-start=l 
/product="MEGFl0 protein 
(KIAA1780)" 

/protei n-i d="BAB47409 . 1" 

/db-xref="Gl: 14017777" 

/transl ation="MVISLNSCLSFICLLLCHWI 

GTASPLNLEDPNVCSHWESYSVTV 

QESYPHPFDQIYYTSCTDILNWFKCTRHRVSYRT 

AYRHGEKTMYRRKSQCCPGFYESG 

EMCVPHCADKCVHGRCIAPNTCQCEPGWGGTNCS 

SACDGDHWGPHCTSRCQCKNGALC 

N PITGACHCAAG FRGWRCEDRCEQGTYGN DCHQR 

CQCQNGATCDHVTGECRCPPGYTG 

AFCEDLCPPGKHGPQCEQRCPCQNGGVCHHVTGE 

CSCPSGWMGTVCGQPCPEGRFGKN 

CSQECQCHNGGTCDAATGQCHCS PGYTGERCQDE 

C PVGTYGVLCAETCQCVNGGKCYH 

VSGACLCEAGFAGERCEARLCPEGLYGIKCDKRC 

PCHLENTHSCHPMSGECACKPGWS 

GLYCNETCSPGFYGEACQQICSCQNGADCDSVTG 

KCTCAPGFKGIDCSTPCPLGTYGI 

NCSSRCGCKNDAVCS PVDGSCTCKAGWHGVDCSI 

RCPSGTWGFGCN LTCQC L N GGACN 

TLDGTCTCAPGWRGEKCELPCQDGTYGLNCAERC 



DCSHADGCHPTTGHCRCLPGWSGV 

HCDSVCAEGRWGPNCSLPCYCKNGASCSPDDGIC 

ECAPGFRGTTCQRICSPGFYGHRC 

SQTCPQCYHSSGPCHHITGLCDCLPGFTGALCNE 

VCPSGRFGKNCAGICTCTNNGTCN 

PIDRSCQCYPGWIGSDCSQPCPPAHWGPNCIHTC 

NCHNGAFCSAYDGECKCTPGWTGL 

YCTQRCPLGFYGKDCALICQCQNGADCDHISGQC 

TCRTGFMGRHCEQKCPSGTYGYGC 

RQICDCLNNSTCDHITGTCYCSPGWKGARCDQAG 

VI I VGN LN S LS RTSTAL PADS YQI 

GAIAGIIILVLWLFLLALFIIYRHKQKGKESSM 

PAVTYT PAM R WN AD YT I S GT L PH 

SNGGNANSHYFTNPSYHTLTQCATSPHVNNRDRM 

TVTKSKNNQLFVNLKNVNPGKRGP 

VGDCTGTLPADWKHGGYLNELGAFGLDRSYMGKS 

LKDLGKNSEYNSSNCSLSSSENPY 

ATIKDPPVLIPKSSECGYVEMKSPARRDSPYAEI 

NNSTS AN RNVYEVE PTVS WQGVF 

SNNGRLSQDPYDLPKNSHIPCHYDLLPVRDSSSS 

PKQEDSGGSSSNSSSSSE" 

SEQUENCE (SEQ): 

1 aaagattgct ttcttctggg acgctgcttg aacgctaacc gcgttgattg gaacagattt 
61 tgtgtcttgg ctggctttgg gtgaagaccg gggagaaaag gttgcgctgc gattctcaag 
121 atctctggac ctggattatc attgcaaatc cattaaagaa ggagaagcaa gcggatttca 
181 gagaggttgt tcttcagaaa aaaatggtta tttctttgaa ctcatgcctg agctttattt 
241 gtttattgtt atgccactgg attgggacag catcacctct gaatcttgaa gaccctaatg 
301 tgtgtagcca ctgggaaagc tactcagtga ctgtgcaaga gtcataccca catccctttg 
361 atcaaattta ctacacgagc tgcactgaca ttctaaactg gtttaaatgc acgcggcaca 
421 gagtcagcta tcggacagcc tatcgacatg gggagaagac tatgtatagg cgcaagtctc 
481 agtgttgtcc tggattttat gaaagcgggg aaatgtgtgt cccccactgt gctgataaat 
541 gtgtccatgg tcgctgtatt gctccaaaca cctgtcagtg tgagcctggc tggggaggga 
601 ccaactgctc cagtgcctgc gatggtgatc actggggtcc ccactgcacc agccggtgcc 
661 agtgcaaaaa tggggctctg tgcaacccca tcaccggggc ttgccactgt gctgcgggct 
721 tccggggctg gcgctgcgag gaccgctgtg agcagggcac ctatggtaac gactgtcatc 
781 agagatgcca gtgccagaat ggagccacct gcgaccacgt cacgggggaa tgccgctgcc 
841 caccaggata caccggagcc ttctgtgagg atctttgtcc tcctggtaaa catggtccac 
901 agtgtgagca gagatgccct tgtcaaaatg gaggagtgtg tcatcacgtc actggagaat 
961 gctcttgccc ttctggctgg atgggcacag tgtgtggtca gccttgcccc gagggtcgct 
1021 ttggaaagaa ctgttcccaa gaatgccagt gccataatgg agggacgtgt gatgctgcca 
1081 caggccaatg tcattgcagt ccaggataca caggggaacg gtgccaggat gagtgtcctg 
1141 ttgggaccta tggcgttctc tgtgctgaga cctgccagtg tgtcaacgga gggaagtgtt 
1201 accacgtgag cggcgcatgc ctctgtgaag caggctttgc tggcgagcgc tgcgaagcac 
1261 gcctgtgtcc tgaggggctc tacggcatca aatgtgacaa acggtgtccc tgccacttgg 
1321 aaaacactca tagctgtcac cccatgtctg gagagtgtgc ctgcaagccg ggctggtcag 
1381 gactctactg taatgagaca tgttctcctg gattctacgg ggaagcttgc cagcagatct 
1441 gcagctgcca aaatggggca gactgtgaca gtgtgactgg aaagtgcacc tgtgccccag 
1501 gattcaaagg aattgactgc tctaccccat gccctctggg aacctatggg ataaactgtt 
1561 cctctcgctg tggctgtaaa aatgatgcag tctgctctcc tgtggacggg tcttgtactt 
1621 gcaaggcagg ctggcacggg gtggactgct ccatcagatg tcccagtggc acatggggct 
1681 ttggctgtaa cttaacatgc cagtgcctca acgggggagc ctgcaacacc ctggacggga 
1741 cctgcacgtg tgcacctgga tggcgcgggg agaaatgcga acttccctgc caggatggca 
1801 cgtacgggct gaactgtgct gagcgctgcg actgcagcca cgcagatggc tgccacccta 
1861 ccacgggcca ttgccgctgc ctccccggat ggtcaggtgt ccactgtgac agcgtgtgtg 
1921 ctgagggacg ctggggcccc aactgctccc tgccctgcta ctgtaaaaat ggggcttcat 
1981 gctcccctga tgatggcatc tgcgagtgtg caccaggctt ccgaggcacc acttgtcaga 
2041 ggatctgctc ccctggtttt tatgggcatc gctgcagcca gacatgccca cagtgcgttc 
2101 acagcagcgg gccctgccac cacatcaccg gcctgtgtga ctgcttgcct ggcttcacag 
2161 gcgccctctg caatgaagtg tgtcccagtg gcagatttgg gaaaaactgt gcaggaattt 
2221 gtacctgcac caacaacgga acctgtaacc ccattgacag atcttgtcag tgttaccccg 
2281 gttggattgg cagtgactgc tctcaaccat gtccacctgc ccactggggc ccaaactgca 
2341 tccacacgtg caactgccat aatggagctt tctgcagcgc ctacgatggg gaatgtaaat 
2401 gcactcctgg ctggacaggg ctctactgca ctcagagatg tcctctaggg ttttatggaa 
2461 aagattgtgc actgatatgc caatgtcaaa acggagctga ctgcgaccac atttctgggc 
2521 agtgtacttg ccgcactgga ttcatgggac ggcactgtga gcagaagtgc ccttcaggaa 
2581 catatggcta tggctgtcgc cagatatgtg attgtctgaa caactccacc tgcgaccaca 
2641 tcactgggac ctgttactgc agccccggat ggaagggagc gagatgtgat caagctggtg 
2701 ttatcatagt tggaaatctg aacagcttaa gccgaaccag tactgctctc cctgctgatt 
2761 cctaccagat cggggccatt gcaggcatca tcattcttgt cctagttgtt ctcttcctac 
2821 tggcattgtt cattatttat agacacaagc agaagggaaa ggaatcaagc atgccagcag 
2881 ttacctacac ccctgctatg agggtcgtca atgcagatta taccatttca ggaacccttc 
2941 ctcacagcaa tggtggaaac gctaatagcc actacttcac caatcccagt taccacacgc 
3001 tcacccagtg tgccacatcc cctcacgtca acaacaggga caggatgact gtcacgaagt 
3061 caaaaaacaa tcaactgttt gtgaatctta aaaatgtgaa ccctgggaag agaggccctg 
3121 tgggggactg cactgggaca ttgccggctg actggaaaca tggcggctac ctcaacgagc 



3181 tcggtgcttt tggacttgac agaagctata tgggaaaatc cttaaaagac ctgggaaaga 
3241 attctgaata taattcaagt aactgctccc taagcagttc tgagaaccca tatgccacta 
3301 ttaaagaccc acctgtactt atcccgaaaa gctcagagtg tggttatgtg gagatgaaat 
3361 cgccggcacg aagagattcc ccatatgcag agatcaataa ctcaacttca gccaacagga 
3421 atgtctatga agttgaacct acagtgagtg ttgtccaagg agtattcagc aataatgggc 
3481 gtctctccca ggatccatat gacctcccaa agaacagtca catcccttgt cattatgacc 
3541 tgctgccagt ccgagacagt tcatcctccc ctaagcaaga ggacagtgga ggtagcagca 
3601 gcaacagcag cagcagcagt gaatgacacc aaaggaccgc ttggtagcca ctggaaccct 
3661 ttccagaact gctgtttggt tcttctccat cctcaatttt gccactttca tgtgaatgtt 
3721 agtcaattcg gtgggcaatt tttggacatg aaccagaaag ctgaaagctg aggctgacac 
3781 ggactgtagg tgctttttgt tcaggtggat tcgaaggagt tagagatgtg atttgccatt 
3841 gctgttagtt ttagaactat acccgtgaag catgacttat tgtaagatgt tggctgaaag 
3901 catgaacttg cagaactccc tcggagacgc aggttgcagt ggacattggg attgttgctt 
3961 gaaaaattaa aatttgaata ttttctctct catttgcatc atacagctct acctaggatt 
4021 gtacagttta ccataaaatt tacttcatga aagtgggaat cactgaacat gtagaagaca 
4081 aggaacatat tgttaactcc tgattcttaa ctttattcaa ctggactcag aattgtaggg 
4141 ataatatgaa tgcaggagga aacattctgt caggcggtat gactggacag actttgaata 
4201 tactctaaaa gtggacagaa aatttacgaa aatcttagat tttgtttaga atgagaaaat 
4261 atacaattag aattatttta gaaatagtag gaagtattgc agaagtcaat acacaaatgt 
4321 gccaggcaga ggtggttttc tctgtttgac tctcaaccaa cttcagatct atgacattat 
4381 tctgatcact ggctccatca tacatattca ccacttgaga ttcataacat atcaatagtt 
4441 atttcataaa tatagaaatg aaataatttt atttttgaca gactggatgg aatgagtgtg 
4501 taatgattga taaaggttgt aaattttaaa tgcaagatga cgcttacgtt ctgtaaacca 
4561 ttagtaatac atgctgtaat atagaattag tggaacattt tgattaatct ttccctagaa 
4621 gtgactgaaa tatttttgtg catatttgag aaagggaact ttccttttat taattgtcaa 
4681 tttagagaaa ctatgcttaa gctggtcttt tgcattgcta atgtgacatg tacccaactt 
4741 ttcattaatt tgtatttcca tttttaaatt gcatattcta tgttttgtag tgtttggatt 
4801 gttaatgaaa aaatattata tgttcgttat tccttgtatt attgccactt atcttttgct 
4861 tgataaaaat gcgttgttct tttttctttt ggagggacaa gatgaaaata tataatttga 
4921 attgattaaa attggtcgtt actaaaatag tatagtaacc acaagtgatt ggcttataaa 
4981 tgaagtagaa atgcttttta atattccaaa atagagttcc ttttgatctg ttggtgctga 
5041 gccttggtta aaccagggag aaggggagca gaaaggaaac gttgttactg atgagtacca 
5101 cagactcatc ttaaaaaaaa ctctcattat ggtgatcata gaattgacca tccaaactgg 
5161 gacactcttg agagtaaatg gagggcatta ttaataatta tcttgtaatg aacttaaatc 
5221 tggactgttc caggcaaacc agacttatct tgcaatatga gaatgctgac acaatgcagg 
5281 aaagccagtt tcccttttgt tgatctactt gaccaagcaa aggggctgaa aaactgaata 
5341 aggaaacaac tttataagag aaacagtggt cttcaatctt ttaaagacat gaaatcctat 
5401 atggcattct gtctcagtga gtcagttaac aaatacgtat gtgcaaccct tctgctagta 
5461 gtgcacataa gtgattatcc ctgccaggta tcgagttgga atatccagtt atttcatgtc 
5521 acacatcggc acgtatgatg gtggtttggt cagatggata atacagcaga cacacttaga 
5581 acactcactg cactggtggc tgttcatttt gaggaactcc aaagtcaatt caaggaaata 
5641 aggaatgacc tggaacaggc cttagaaaca attgatttat tccaatagtt aaccactggc 
5701 tggctcccaa ctctaggtga taggcatcta attgagacat gtgtgagtca atagccatcg 
5761 gggtcctttt tggtgagaag aaataagaca ttttctccct tttaaagatc atctcttcag 
5821 agctgcatgg taaatttgaa tttccccatt gttctctaga gataggtcaa aaaaacaact 
5881 gtttcttttc ttattccact attaataaat tgaaagcttg ctgtacaaag gcaacagcaa 
5941 cttaaaaaag aaaatctgga caatagatgt ggactccaag gggccactgc catcccttcc 
6001 tgtgtgctct ttttgacaag taaatcactc cagttagctg atgtcctgaa atgacatttg 
6061 tacctgaggc aattcttatt agagcttact caggactttt caaacatcag gacacacata 
6121 gcaatttgga tgttgtaaaa ctacgtgtta catttggaga ggtcttgttg agaaggcaaa 
6181 attttctaaa tattttgata tcaatttggg taaagaaagg aatacttttg ttagaatgag 
6241 aattaaagag aaaacaaaag aatcttggca aaattttccc tctgaaatta caaactttag 
6301 gaacttttcc agatgtatcg aatttaattt gacattgatg tcactcttca taccaagtga 
6361 atctattctc atgaactttg aacccatgtt aattttggac cctataacta atttcctttc 
6421 tgtatcccat ggtggtcaaa gcagtaaagg agtttagaga tacttagcaa atgccattca 
6481 tatagtattg ttagccaaac tattattcct tccttagaag ttgatgtatc agaaaattta 
6541 taagttattt gtatttatat ataagtacat cagaacttgc ccacatgata actcatgttt 
6601 gctttaataa ggaaaatatg ttatttgagg ttaattatat gatagtgagg aacattttac 
6661 aattatttac agtgtttgac attaattgct gtcctatgat acagtagtaa tgttaacaga 
6721 gagtttaata tgtttagcat tatctttatg gaatttatat tcaccaattt caggaaaacc 
6781 aaccatagaa ctttataata ggacagttat aacctgactg aaccaaatcc atcagcataa 
6841 tgactgtgtg ttctgagaaa tgcaaacaac tttatgaaag tgtggtcatc ttgaccccca 
6901 aagagccaca gtagctgtca attattgaga tttttaaatg gtaaatattg caaaagttaa 
6961 acaagggtac cacaatatca tttataattg atgtcaatat tagtagtcca ctttatgctt 
7021 aaaataaaat cataaaaaat caagttttaa ggttatattt tttaaaataa tgaattttct 
7081 atctctaggc aacatgtctt tattattcaa gctgaatgtt taagagagat tttggtctta 
7141 aaggcttcat catgaaagtg tacatgcata tgcaagtgtg aattacgtgg tatggatggt 
7201 tgcttgttta ttaactaaag atgtacagca aactgcccgt ttagagtcct gttaatattg 
7261 atgtcctaac actgggtctg cttatgctag ttcagtattt ggtctaggct tcattttgat 
7321 atgactgaat tgcaatctat atttttaaaa agaaaaatca aaatacatta gggtttactt 
7381 ggttgtggca aacaaacaaa aaatgctttc agtgttgaaa tatctctatt ttccaaagat 
7441 gataaacttt catcgttctt agcctacatt gtcatttata tcaccaaatg agtttataga 
7501 ttaatgcaat aaactttcta at 
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